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Docket 6037-03 

AUDIO SYNTHESIS USING DIGITAL 
SAMPLING OF CODED WAVEFORMS 



Field of the Invention 
5 This invention relates to coding and reproduction of audio signals. 

Background of the Invention 

Digital synthesizers or other electronic systems that create music or 
other sound can often be utilized as an electronic musical instrument or 
electronic sound machine. A digital synthesizer is often arranged to accept 
10 signals from a musician or operator interface and to produce digital output 
signals that represent analog signals in the audio frequency range. A 
digital synthesizer is also frequently arranged to accept pre-recorded 
sequences of music events in a Musical Instrument Digital Interface 
(MIDI) format. 

15 A digital output signal from a digital synthesizer can be converted to 

an analog signal and delivered to equipment, such as a loudspeaker, tape 
recorder, mixer or other similar device, to reproduce the signals as 
intelligible sound. The digital synthesizer can be arranged to provide 
output signals that simulate the sounds of one or more conventional 

20 musical instruments. Alternatively, the synthesizer can be arranged to 
simulate sounds that would be emitted by a theoretical instrument having 
predetermined audio characteristics that are partly or wholly different 
from the corresponding characteristics of any convention, known 
instrument. 

25 For previous synthesizers, synthesis of music and other sounds is a 

formidable task. A real musical instrument produces a complex blend of 
many different frequencies and impart what is commonly referred to as 
"tone color" or "timbre" to the associated sound. For example, a 
percussion sound made by a drum or cymbal is an aperiodic sound that 

30 cannot be fully described by any simple mathematical function. 

Accordingly, the reproduction or synthesis of digital signals representing 
sounds as rich and complex as a real musical instrument is a formidable 
task for digital signal sampling and processing. Ideally, the synthesizer 
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should respond to the nuances of a musician's manipulation of the interface 
(e.g., a particular musical instrument). For example, a snare drum has 
many different audio characteristics when played in different locations, 
such as adjacent to the center of the drum membrane, adjacent to the rim, 
5 or on the rim of the drum. Additionally, the audio characteristics of a 
percussion instrument will vary with the striking force and stylistic 
inflection of the musician. 

By providing a large enough waveform database, more realistic and 
expressive synthesis of a given group of sounds can be achieved. Many 

10 synthesizers relying on waveform sampling technology are used 

extensively in the music and multimedia fields for their ability to create 
musical sounds that closely emulate the sound of a musical instrument. 

At present, a digital synthesizer system often utilizes a MIDI to 
control the synthesizer. The MIDI interface creates control signals or 

15 other MIDI control data. The MIDI control data may represent a music 
event, such as occurrence of specific musical notes from a particular 
musical instrument, such as a piano, drum or horn. However, MIDI has 
some fundamental limitations. For example, a synthesizer that uses a MIDI 
notation has trouble recreating a human voice. 

20 A conventional synthesizer system utilizes a large solid state 

memory that stores the digital waveform signals representing each note 
played on a particular instrument. The memory can be a static random 
access memory (SRAM), a dynamic random access memory (DRAM), a 
read only memory (ROM) or some other similar memory with sufficiently 

25 rapid response. When a musician actuates a key or other interface device, 
the appropriate waveform is selected, depending upon the key actuated and 
upon the intensity and velocity of the key strike. The waveform is then 
converted into an analog output signal for sound reproduction. A digital 
waveform signal can be combined with other waveform signals that 

30 represent other musical notes being displayed, before the conversion to an 
analog output signal. 

In this arrangement, the synthesizer, in effect, merely plays back 
digital recordings of individual musical notes or other sounds. Each 
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waveform signal is stored as a collection of individual data words, each 
representing a single sample of the waveform at a particular time. 

A sequencer is a device for editing musical data, such as MIDI 
events, and converting the musical data into a musical signal in real time. 
5 Synthesizers are frequently used together with a computer to play a 
musical score. In this arrangement, a sequencer reads a MIDI file as an 
ordered sequence. However, it is generally recognized that MIDI files and 
synthesizers are unable to recreate the nuances in a recording or to capture 
the background noises of a live audience. 

10 Other prior art systems store only one or a relatively few waveform 

signals representing each musical instrument. These stored waveforms are 
then adjusted by digital signal processing or other electronic techniques, 
such as nonlinear distortion, to reflect the frequency and amplitude 
changes associated with particular musical characteristics, as indicated by 

15 the MIDI control data. For example, the frequency and amplitude of a 
sample waveform representing middle C on a piano can be adjusted to 
synthesize a different piano note and volume. However, these synthesizers 
are unable to (re)produce the complex blends or tone color with high 
enough fidelity for the musically trained ear. 

20 In another approach, some systems use digital filtering to adjust the 

harmonic content of a particular note. However, these systems require a 
large amount of computer power and can be affected by audio quality 
degradation. 

What is needed is an audio synthesizer system that can reliably 
25 reproduce all types of sounds, including music and the human voice. 
Preferably, the system should allow a separation of music or other 
information-bearing sound into two or more selected components, where 
no single component allows reproduction of sounds resembling the 
original sounds. Preferably, the system should allow encryption or other 
30 encoding of one or more of the selected components and should allow 
change of parameters affecting the format of one or more of the 
components. 
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Summary of the Invention 

These needs are met by the invention, which provides a system for 
removing one or more selected segments (referred to collectively as pre- 
processing components, or PPCs) of a digital signal stream that represents 
5 an assembly of information-bearing sounds (ISA), including but not 
limited to music and one or more human voices, to produce a remainder 
data file (referred to as a reduced data file, or ISDF), having one or more 
components, after decimation of the ISA by removal of the PPC(s). The 
ISDF, as a first sequence, and the assembly of PPCs, as a second sequence 

10 are preferably processed differently and communicated using different 
communication channels. The ISDF is preferably encrypted or otherwise 
encoded, using an encryption key EK that may be chosen independently or 
may depend upon information contained in the PPC(s). A data supplement 
DS, associated with the PPC(s) sequence, contains one or more of the 

15 following information items: location of at least one PPC within the 
original ISA; size of one or more of the PPCs; size of one or more 
components of the ISDF; separation distance within the ISA of two 
consecutive PPCs (if two or more are removed); and at least a portion of 
the encryption key EK. The encrypted version of the ISDF, E(ISDF), is 

20 communicated over a first communication channel, and the PPC(s) and the 
associated data supplement DS are communicated over a second 
communication channel, which may be arranged as a secure 
communication channel. Optionally, the PPC(s) and associated data 
supplement channel may also be encrypted for communication. Optionally, 

25 more than one sequence of removed PPCs can be formed, each with an 
associated data supplement DS, and communicated separately from the 
remaining ISDF component(s). 

The invention allows an assembly of information-bearing sounds 
ISA, such as music or the sound(s) of one or more human voices, to be 

30 disassembled into N+l complementary sequences (N>1), where any 

collection of N+l-k sequences (1 < k < N), including the ISDF, will not 
allow reconstruction of an assembly of sounds resembling the original 
ISA. 
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One useful application of this invention is the provision of a legally 
protected ISA (e.g., an ISA covered by copyright) as N+l sequences , 
where any collection of N or fewer sequences cannot be used to reproduce 
the original ISA. The invention would, for example, allow distribution of 
5 a subset N+l-k of these N+l sequences to a prospective authorized listener 
at one time (e.g., for storage for possible future use) and distribution of 
the remaining k sequences (1 < k < N) at another time, when the listener is 
now authorized to listen to the entire ISA. 

Another useful application of the invention is the provision or 
10 earlier distribution of a first sequence, representing the bulk or majority 
of the sounds needed to accurately reproduce the ISA, where the first 
sequence has certain critical data removed. The second sequence, 
containing the remainder of the ISA data needed to accurately reproduce 
the ISA, is distributed using another channel and/or at another, more 
15 convenient time. 

Brief Description of the Drawings 

Figure 1 is a schematic view of apparatus that creates an MP3 file. 

Figures 2A-2C illustrate an original ISA, expressed as a digital 
sequence, and the results of disassembly of the ISA into an ISDF and a 
20 sequence of PPCs and of association of a DS with a PPC sequence. 

Figure 3 is a schematic view of a digital file sampling composer and 
a digital file sampling synthesizer. 

Figures 4, 5 and 6 are flow charts illustrating practice of 
embodiments of the invention. 
25 Description of Best Modes of the Invention 

MP3, which refers to Layer 3 of the MPEG1 standard, has been 
developed and adopted as a useful audio compression standard for music 
and other assemblies of information-bearing sounds (ISAs). MP3 is 
discussed in some detail in ISO/JEC 11172-3, an international standards 
30 document, incorporated by reference herein. A protocol for creation of an 
MP3 file is well known to those of ordinary skill in the are of sound 
compression. Many software programs, such as Audio Catalyst, offered by 
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Xing Technology, which runs on a personal computer, implement creation 
an MP3 file. 

Figure 1 illustrates apparatus 1 1 that may be used to create an MP3 
file 13 from an ISA 15. The ISA 15 is received by a digital file sampling 
5 composer (DFSC) 17. The DFSC 17 creates and issues a synthesizer 
information file (SIF) 19 and an item specific data file (ISDF) 21. For 
decoding, a DFS synthesizer 23 receives and uses the SIF 19 to provide 
suitable sequencing for the ISDF 21. The recovered MP3 file 13 is 
received and decoded by an MP3 decoder 25, which issues a pulse code 

10 modulation (PCM) waveform 27. The PCM waveform 27 can be played or 
otherwise "displayed" through a digital-to-analog converter (DAC) 29, 
connected to a speaker 31. 

Figures 2A-2C illustrate an ISA 15 as an ordered sequence of bits, 
nibbles, bytes or other information units, numbered n = 1, 2, ... , 27, that 

15 are part of an ISA. The ISA (e.g., an MP3 source file) 15 can be any 

length. A first subset of these information units, namely "2", "5", "6", "7 M , 
"12", "17", "18", "26", "27", is removed as a sequence of pre-processing 
components (PPCs) and is stored in an SIF 19. The remaining data are 
stored in an ISDF for encryption. In Figure 2A, the ISA 15 is shown with 

20 a 32-bit header (or trailer, if desired), 27 bytes of data and an ID3 tag. 
The ISDF 21, shown in Figure 2B, has a file identification (FID) tag and a 
data field with a selected number of bytes of samples. The SIF 18, shown 
in Figure 2C, has an FID field, an ID3 tag, a 32-bit header field, a 2 byte 
file block (FB) field, a 2 byte song block (SB) field, a variable block size 

25 (BS) field, a 1-bit end-of-selection (EOS) flag and at least a portion of an 
encoding/encryption key (EK). The field block FB specifies the number of 
blocks of data for the ISDF 21. The song block SB specifies the number of 
blocks of data to be removed for the SIF 21, preferably using a file 
stripping algorithm. The block size filed BS specifies the number of bits 

30 per block, which may be uniform or may be non-uniform. The EOS flag 
indicates that the coding process ended on data for the SIF 19 (EOS=l) or 
ended on data for the ISDF 21 (EOS=0). 
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Figure 3 illustrates a DFSC 17, which includes a first CPU 41, a 
first RAM 43, a first ROM 45, a first user interface 47, a removable 
storage unit 49, a data I/O module 51, and first and second communication 
channels, 53 and 55, The first interface 47 may include any or all of a 
5 keypad, keyboard, light pen or other data/command entry device, a mouse, 
and a display module, such as a monitor, LED or LCD device. The first 
and second channels, 53 and 55, may be the same channel. Alternatively, 
the first and second channels may be different. For example, the second 
channel 55 may be a secure channel that offers at least a reasonable degree 
10 of protection against unauthorized reception of the signals on the second 
channel. 

Figure 3 also illustrates a DFSS 23, which includes a second CPU 
61, a second RAM 63, a second ROM 65, a second user interface 67, a 
data I/O module 69, a data synthesizer database 71, a removable storage 

15 device 73, and a DAC 75. The data synthesizer database 71 may include an 
ISDF database 71A and/or an SIF database 71B. The removable storage 
device 73 may include an ISDF database 73A and/or an SIF database 73B. 

Digital samples that are part of the ISDF 21 and/or are part of the 
SIF 19 may be transferred from the DFS composes 17 to the DFS 

20 synthesizer 23 using the first communication channel 53 )for ISDF 

samples) and/or the second communication channel 55 (for SIF samples) 
and/or may be recorded using the removable storage device 73 and 
subsequently transferred to the DFS synthesizer 23. 

In a preferred embodiment, at least one of the first and second 

25 communication channels, 53 and 55, is the Internet. However, any other 
communication channel(s) can be used to communicate ISDF samples 
and/or SIF samples, including wireless methods, such as FM subcarrier, 
CDPD, transmission using the vertical blanking interval of a television 
signal, and other similar wireless methods. 

30 The ISDF database 71A and the SIF database 71B may include 

samples from any of several sample sources. 
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The DFS synthesizer 23 can operate in many different modes, 
including the following: (1) synthesize an MP3 file from an ISDF database 
71 A and from SIF samples streamed from the second communication 
channel 55; (2) synthesize an MP3 file from ISDF samples streamed from 

5 the first communication channel 53 and from an SIF database 71B; (3) 
synthesize an MP3 file from an ISDF database 73 A and from SIF samples 
streamed from the second communication channel 55; and (4) synthesize 
an MP3 file from an ISDF database 71 A and from an SIF database 73B. 

Figure 4 is a flow chart illustrating a procedure for practicing audio 

10 synthesis according to the invention. In step 81, an ISA including an 

ordered sequence of units containing digital symbols, is provided. In step 
83, the PPCs within the ISA are designated and stripped out or removed 
from the remainder JSDF of the ISA. In step 85, a file identification 
number FID is assigned to the PPCs and to the ISDF. In step 87, one or 

15 more parameters that characterizes the PPCs and/or the ISDF components 
is provided and is placed in a data supplement DS that is associated with 
the PPCs (preferable), to provide an augmented PPC sequence, PPC+DS, 
and/or with the ISDF. In step 89, an encoding key EK (e.g., an encryption 
key) is provided, either independently or using one or more parameters 

20 provided by one or more components of the PPCs. Normally, the EK will 
have a fixed encoding procedure but optionally will have one or more 
parameters that are adjustable according to information supplied by the 
PPCs or the ISDF. For example, one or more PPCs may provide initial 
values needed to begin the encoding process. In step 91, the ISDF (or an 

25 augmentation ISDF+DS) is encoded (e.g., encrypted), using the encoding 
key EK, to produce an encoded version E(ISDF). In step 93 (optional; 
usually not necessary), the augmented PPC sequence, PPC+DS, is also 
encoded to provide an encoded version E(PPC+DS). 

In step 95, the encoded version E(ISDF) is communicated using a 

30 first communication channel, and the augmented PPC sequence, PPC+DS, 
is communicated using a second communication channel. The first and 
second communication channels may be the same, if desired. Alternatively, 
the second communication channel may be a secure channel, to protect an 
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non-encoded augmented PPC sequence, PPC+DS, from disclosure to 
unauthorized entities. 

In step 97, the encoded version E(ISDF) and the augmented PPC 
sequence, PPC+DS (or E(PPC+DS)), are received or otherwise provided, 
5 and a decoding (e.g., decryption) process is begun. In step 99, the data 
supplement DS within the augmented PPC sequence, PPC+DS, is 
examined, the PPC and/or ISDF parameters are identified, and 
(optionally) part or all of the encoding key EK is recovered. In step 101, 
the encoded version E(ISDF) is decoded, to (re)produce the ISDF. In step 

10 103, the PPC components are repositioned among the ISDF components to 
recover the original ISA. In step 105 (optional), the ISA is provided for 
an ISA repository for playback, display, storage or farther processing. 

Optionally, the steps 97-103 may be varied by providing a first of 
the two sequences, E(ISDF) or PPC+DS, in a first database that is 

15 available at one or more locations to any potential user. However, 

possession of the first sequence, or of the second sequence, along, does not 
allow the user to reproduce the sounds of the original ISA. Each of the 
first sequence and the second sequence is a decimated version of the 
original ISA; and the sounds, if any, reproduced by the first sequence or 

20 by the second sequence alone are, preferably, not intelligible. The second 
sequence is withheld until the user has obtained proper authorization (e.g., 
a license) to reproduce the sounds of the original ISA. The first sequence 
may, if desired, represent the majority or bulk of the digital signals 
needed to reproduce the original ISA so that the remainder (second 

25 sequence) requires far less bandwidth or communication capacity for 
delivery than does the original ISA. 

The encoding procedure may, for example, incorporate an 
encryption process, such as cipher block chaining (CBC), described by 
Bruce Schneier, A pplied Cryptography , John Wiley & Sons, Second 

30 Edition, 1996, pp. 193-197. In one implementation of CBC, a cleartext 
block of a selected size is Exclusively ORed (XORed) or Exclusively 
NORed (XNORed) with a key block and with a preceding ciphertext block 
to produce a new ciphertext block. The resulting ciphertext block is 
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XORed or XNORed with a next consecutive cleartext block and with the 
next consecutive key block, and so on until a final encrypted block is 
generated. The operations XOR and XNOR are each symmetric, 
commutative, associative and bilinear so that order is not important within 

5 the ith step. If Pi, Ci and Ki (i = 1, 2, ...) represent cleartext block number 
i, ciphertext block number i and key block number i, respectively, a CBC 
procedure can be represented mathematically as 

C(i+1) = Pi XNOR Ci XNOR Ki, (1) 
and the initial ciphertext block (i=l) can be specified by one or more of 

10 the PPCs. The choice XNOR, rather than XOR, is made here because of a 
useful "inversion" relation 

A XNOR A = I (identity) (2) 
for any block A of binary symbols. With this approach, preferably, the 
size of a key block Ki is the same as the size of a cleartext block Pi and is 

15 the same as the size of a cipher text block Ci. This constraint can be 
relaxed somewhat. 

When the encrypted message E(ISDF) is received, the process is 
reversed, beginning with an initial ciphertext block Ci' that is determined 
using information obtained from that data supplement component(s) 

20 received as part of the encrypted message E(ISDF): 

p(i+i) = c(i+iy xnor cr xnor pl (3) 

Each of Eqs. (1) and (3) illustrates a sub-encryption cycle (or sub- 
decryption cycle) for the encryption or decryption process, where an 
initial ciphertext block is optionally provided by a key block Ki with i = 1. 

25 Each sub-encryption cycle, set forth as an example in Eq. (1), will have an 
independently specified key component Ki. 

Figure 5 is a flow chart illustrating a file stripping algorithm that 
can be applied to an ISA using an embodiment of the invention. In step 
111, a first sequence of units (to hold an ISDF) and a second sequence of 

30 units (an SIF to hold the PPCs) are created, each of unspecified length, and 
the same FID is assigned to each sequence. The assembled first and second 
sequences will resemble the sequences shown in Figures IB and 1C. In step 
113, a strip count index SC is set equal to the specified number SB of 
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PPCs. In step 115, the first PPC, or the next PPC in the ordered sequence 
of PPCs, is moved from the initial sequence (Figure 1 A) to that PPCs 
assigned place in the second sequence. A selected portion of this PPC 
(which may be the empty set for a particular PPC) is stored in a location, 
5 denoted as TK, for future use. The particular arrangement of bits or other 
information units stored in TK is referred to herein as the "content 1 ' of 
TK, written TK(c). A portion of this content TK(c) can also be used to 
determine one or more parameters for the encryption key EK. 

In step 117, the strip count index SC is decremented by 1 (SC SC- 

10 1). In step 119, the system examines the last unit in the particular 

component of the second sequence (PPCs) and determines if this last unit is 
an end-of-file unit (EOF - 0). If the answer to the query in step 119 is 
"yes", the system sets an EOK flag equal to a selected value (e.g., EOK = 
1), in step 121, and the procedure terminates, at step 123. 

15 If the answer to the query in step 119 is "no", the system determines 

if the strip count index satisfies SC < 0, in step 127. If the answer to the 
query in step 127 is "no", the system recycles to step 115 and the 
procedure continues. If the answer to the query in step 127 is "yes", the 
system initializes the strip count index to FB + TK(cl), in step 129, where 

20 TK(cl) is a selected subset of the content TK(c) held at the location TK. 
Use of a combination FB -f TK(cl), with TK(cl) variable, allows the 
number of sub-cycles covered in the (following) steps 121 through 129 to 
be varied or "dithered" within the procedure. 

In step 131, the system encrypts the first block, or the next 

25 consecutive block, in the first sequence of units (ISDF). In step 133, the 
system decrements the (new) strip count index SC (SC -» SC-1). In step 
135, the system determines if an EOF is present in this block. If the 
answer to the query in step 135 is "yes", the procedure terminates at step 
137. If the answer to the query in step 135 is "no", the system determines 

30 if the (new) strip count index satisfies SC < 0, in step 139. If the answer to 
the query in step 139 is "no, the system recycles to step 131 and the 
procedure continues. If the answer to the query in step 139 is "yes", the 
system recycles to step 113 and the procedure continues. 
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Figure 6 is a flow chart illustrating a file assembly procedure that 
can be used with an embodiment of the invention. In step 141, the system 
identifies the data supplement DS and reads the parameters FB, KB, 
SISFD, SPPC, EOK and EK. In step 143, the system initializes an 
5 assembly count index, AC = SB. In step 145, the system moves the first 
block, or the next consecutive block, of the first sequence of units (PPC) 
to its proper location in the original sequence of units (specified by SB, 
FBS and KBS) and stores a selected portion of the PPC information 
(which may be the empty set) at a designated location TK. In step 147, the 

10 system decrements the assembly count index (AC AC-1). 

In step 149, the system examines the last unit in the particular 
component of the second sequence (PPCs) and determines (1) if this last 
unit is an end-of-file unit and (2) if EOK = L If the answers to the 
compound query in step 149 are "yes" and "yes", the procedure 

15 terminates, at step 151. If the answer to one or both parts of the compound 
query in step 149 is "no", the system sets the EOK flag equal to another 
selected value (e.g., EOK = 0), in step 155. In step 157, the system 
determines if the assembly count index satisfies AC < 0. 

If the answer to the query in step 157 is "no", the system recycles to 

20 step 145 and the procedure continues. If the answer to the query in step 
157 is "y^s", the system resets an initial assembly count index, AC = FB + 
TK(cl), in step 159, where TK(cl) is a selected subset of the content 
TK(c). Again, use of a combination FB + TK(cl), with TK(cl) variable, 
allows the number of sub-cycles covered in the (following) steps 161 

25 through 169 to be varied or "dithered" within the procedure. 

In step 161, the system decrypts the next consecutive block of a 
reassembled original sequence. In step 163, the system decrements the 
(new) assembly count index (AC -» AC-1). In step 165, the system 
examines the last unit in the particular component of the first sequence 

30 (ISDF) and determines (1) if this last unit is an end-of-file unit and (2) if 
EOK= 0. If the answers to this compound query in step 165 are "yes" and 
"yes", the system terminates the procedure, at step 167. If the answer to 
one or both parts of the compound query in step 165 is "no", the system 
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determines, at step 169, if the assembly count index AC satisfies AC < 0. 
If the answer to the query at step 169 is "no, the system recycles to step 
161 and the procedure continues. If the answer to the query in step 169 is 
"yes", the system recycles to step 143 and the procedure continues. 
5 In another embodiment, no PPCs are removed, and the entire file 

becomes an ISDR No data supplement DS need be included, unless (part 
of) the encoding/encryption key EK is to be communicated with the 
encoded/encrypted version E(ISDF) or as part of an SIR Otherwise, the 
SIF and the DS may be deleted in this embodiment. With reference to 
10 Figure 4, steps 87, 93, 99 and/or 103 are optionally deleted in this 

embodiment. The file stripping algorithm, shown in Figure 5, and the file 
assembly algorithm, shown in Figure 6, are not used in this embodiment; 
this may also be implemented by formally setting FB = SB = 0 in Figures 
5 and 6. 

15 The disclosed procedure relies primarily upon several features to 

provide secure communication of an ISA, as two or more sequences, 
where no combination of less than all the sequences will produce an 
assembly of sounds that adequately resemble the original ISA. First, 
portions of the message, the PPCs, are removed from the original ISA and 

20 are communicated separately, optionally using a secure channel. Second, 
the remainder of the ISA, the ISDF, is encrypted, using an encoding or 
encryption key that can vary in length and other characteristics from one 
block or component of the ISDF to another, as indicated in the discussion 
of Figures 5 and 6. Third, information in one or more of the PPCs can be 

25 used to determine one or more parameters in portions of the encoding or 
encryption key EK. Fourth, an augmented PPC sequence PPC+DS can also 
be encoded and/or encrypted before communication thereof. 

Among other things, an unauthorized (or unlicensed) recipient of 
the encoded/encrypted version E(ISDF) and/or of the augmented PPC 

30 sequence PPC+DS would need to be aware of (1) the encoding or 

encryption key used to process the encoded/encrypted version E(ISDF) 
(and, optionally, used to encrypt the augmented PPC sequence PPC+DS), 
(2) the placement and meaning of ISDF, PPC and/or EK information 
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contained in the data supplement DS, (3) the use of the information in the 
data supplement DS to reposition the PPCs within a decoded or decrypted 
ISDF, and (4) possible variation of the encoding/encryption key from one 
ISDF component to the next, in order to fully decode or decrypt an 

5 intercepted ISDF representing and ISA. The number of parameters and 
block-to-block variability incorporated in this system allows one to 
distribute or otherwise provide one or more, but less than all, of the 
sequences formed from the ISA, without making available any version that 
can be intelligibly played back. The missing sequence(s) from the ISA can 

10 be distributed to licensed or otherwise authorized "listeners", by another 
channel and/or at another time. 
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What is claimed is: 

1. A method of encoding or encrypting data, comprising: 
providing an assembly of information-bearing sounds (ISA); 
removing one or more selected segments of the assembly, to 

produce a specified data file; 

providing an encoding/encryption key and encoding or encrypting 
the specified data file; and 

communicating the encoded or encrypted specified data file in a first 
selected communication channel and communicating the removed segments 
in a second selected communication channel. 

2. The method of claim 1, further comprising providing a data 
supplement that indicates at least one of: location of at least one of said 
removed segments within said ISA; size of at least one of said removed 
segments within said ISA; number of segments removed; separation 
distance between two consecutive removed segments within said ISA; and a 
selected portion of said encoding/encryption key; and 

communicating said data supplement in said second selected 
communication channel. 

3. The method of claim 1, further comprising providing said 
encoding/encryption key with at least one key parameter that uses 
information from at least one of said removed segments. 

4. The method of claim 1, further comprising selecting said first 
and second communication channels to be the same channel. 

5. The method of claim 1, further comprising providing said second 
channel as a secure communication channel. 

6. The method of claim 1, further comprising concatenating said 
removed segments and said data supplement as a concatenated data file. 
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7. The method of claim 6, further comprising encrypting said 
specified data file using cipher block chaining of at least one block of said 
concatenated data file and at least one encrypted block from said specified 
data file, 

8. The method of claim 7, farther comprising providing said at least 
one encoding/encryption parameter for said encoding/encryption key by 
providing a block of said concatenated data file as an initial block for said 
at least one encrypted block of said data. 

9. The method of claim 1, further comprising removing at least first 
and second segments from said data file, where the first segment and the 
second segment have equal length. 

10. The method of claim 1, farther comprising removing at least 
first and second segments from said data file, where the first segment and 
the second segment have different lengths. 

11. The method of claim 1, further comprising combining said 
removed segments with said specified data file to form a combined data 
file and reproducing the combined data file as an assembly of sounds. 

12. A method of decoding or decrypting data, comprising: 
providing an encoded or encrypted first data file; 

providing a second data file and a data supplement that indicates at 
least one of: an assigned location of at least one designated segment of the 
second data file within a non-coded and non-encrypted version of the first 
data file; size of at least one designated segment of the second data file 
within the non-coded and non-encrypted first data file; number of selected 
segments designated; separation distance of at least two consecutive 
designated segments of the second data file within the non-coded and non- 
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encrypted first data file; and a selected portion of an encoding/encryption 
key used to encode or encrypt the first data file; and 

using the data supplement to decode or decrypt the encoded or 
encrypted first data file and to position at least a first sequence and a 
second sequence, drawn from the second data file, within the first data 
file. 

13. The method of claim 12, further comprising: providing said 
encoded or encrypted first data file on a first communication channel and 
providing said concatenation of said second data file and said data 
supplement on a second communication channel. 

14. The method of claim 13, further comprising selecting said first 
and second communication channels to be the same channel. 

15. The method of claim 13, further comprising providing said 
second channel as a secure communication channel. 

16. The method of claim 19, further comprising determining at least 
one parameter of said encoding/encryption key using information in said 
second data file. 

17. The method of claim 12, further comprising providing said 
encoded or encrypted first data file using cipher block chaining of at least 
one block of said concatenation of said second data file and said data 
supplement and at least one encoded or encrypted block from said first 
data file. 

18. The method of claim 17, further comprising providing at least 
one encoding/encryption key parameter for said encoding/encryption key 
by providing at least one of said first sequence and said second sequence as 
an initial block for said at least one encoded/encrypted block of said data. 
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19. The method of claim 12, further comprising providing said 
second data file and said data supplement as a concatenated data file. 

20. The method of claim 12, further comprising combining said 
removed segments with said specified data file to form a combined data 
file and reproducing the combined data file as an assembly of sounds. 

21. A method of communicating data, the method comprising: 
providing an assembly of information-bearing sounds as a digital 

file of data; 

removing one or more selected segments from the data file, to 
produce a specified data file having at least a first block and a second 
block; 

providing an encoding/encryption key having at least a first key 
portion and a second key portion; 

providing a data supplement that indicates at least one of: location of 
at least one of the removed segments within the data file; size of at least 
one of the removed segments within the data file; number of segments 
removed; separation distance between two consecutive removed segments 
within the data file; and at least a portion of the encoding/encryption key; 

encoding or encrypting the first block and the second block of the 
specified data file, using the first portion and the second portion, 
respectively, of the encoding/encryption key; and 

communicating the encoded or encrypted specified data file in a first 
selected communication channel and communicating the removed segments 
and the data supplement in a second selected communication channel. 
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Abstract of the Invention 

Method and system for audio synthesis of a digital data file 
representing an assembly of information-bearing sounds in digital form. 
One or more spaced apart data segments are designated as key blocks and 
are removed from the original data file. The remainder of the data file is 
encrypted or otherwise encoded and communicated to a selected recipient 
on a first channel. Locations, sizes and separation distances of key blocks 
from each other within the original data file and a selected portion of the 
encoding or encryption key are placed in a data supplement. The removed 
segments and data supplement (optional) are communicated to the selected 
recipient on a second channel and/or at another time. The original data file 
is recovered by using the data supplement information, or using already 
available information, decoding or decrypting the encoded or encrypted 
data file and replacing the removed segments within the data file 
remainder. Neither the remainder data file nor the removed segments plus 
data supplement is sufficient, by itself, to allow reproduction of the 
original data file. Each of the remainder data file and the removed 
segments plus data supplement can be distributed separately and 
subsequently combined when authorization or license to reproduce the 
sounds has been obtained. 
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